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We report the genome sequences of two Streptococcus thermophilus strains, TH1435 and TH1436, isolated from raw goat milk 
devoted to the production of artisanal cheese in the FriuU-Venezia Giulia region in Italy. The genome sequences of these two 
quickly acidifying strains are the first available genome sequences of S. thermophilus strains isolated in Italy. 
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Streptococcus thermophilus is a homofermentative thermophilic 
lactic acid bacterium (LAB) used worldwide as a starter culture 
for the manufacture of a variety of fermented dairy products (1). 
To date, the complete genomes at the chromosome level of S. ther- 
mophilus strains CNRZ1066 (2), JIM8232 (3), LMD-9 (4), 
LMG18311 (2),MN-ZLW-002(5),andND03 (6) have been pub- 
lished. However, S. thermophilus TH1435 and TH1436 represent 
the first cases of S. thermophilus strains isolated in Italy and the 
only ones from goat milk. 

S. thermophilus TH1435 and TH1436, collected from two al- 
pine huts (malghe) from raw goat milk used for the artisanal pro- 
duction of Italian cheese, are capable of rapidly lowering the pH of 
milk, which represents a key technological feature for the dairy 
industry. Moreover, strain TH1436 is capable of utilizing galac- 
tose, while TH1435 lacks this feature. 

Whole-genome sequencing of strains TH1435 and TH1436 
was performed with an lUumina MiSeq sequencer at the Rama- 
ciotti Centre, Sydney, Australia. Genomic libraries were prepared 
using the Nextera XT kit lUumina (lUumina, Inc., San Diego, CA), 
which produced a mean insert size between 800 and 1,200 bp. A 
total of 776,373 and 1,527,857 paired-end reads (2 X 250 bp) were 
generated and gave 134- and 159-fold coverages of the TH1435 
and TH1436 genomes, respectively. Approximately 85% of these 
reads were assembled into 36 and 28 large scaffolds, respectively, 
using a manually curated consensus of assemblies obtained using 
version 1.2.10 of the Velvet software (7) and version 2.8 of the 454 
Newbler Assembler (454 Life Sciences, Branford, CT). The draft 
genome of S. thermophilus TH1435 is a single circular chromo- 
some of 1,750,348 bases in length, with a mean G+C content of 
38.9% and two putative plasmids individuated by BLAST match 
(scaffolds 33 and 35). The draft genome of strain TH1436 is a 
single circular chromosome of 1,780,473 bases, with a mean G+C 
content of 39.0%; no plasmid sequences were detected for this 
strain. 

Genome annotation was performed using RAST (8) and by the 
NCBI Prokaryotic Genomes Annotation Pipeline (9). A total of 
1,925 coding sequences (CDSs) and 47 structural RNAs were pre- 



dicted in strain TH1435, while 1,899 CDSs and 48 RNAs were 
found in strain TH1436. Additionally, several phage-associated 
sequences were identified, 12 genes related to transposases and 7 
clusters of regularly interspaced short palindromic repeats 
(CRISPRs) in both strains. 

A comparison against the complete genome of S. thermophilus 
CNRZ1066 revealed that TH1435 and TH1436 contain 38 com- 
mon predicted CDSs that are absent in the reference strain. Fur- 
thermore, 31 CDSs specific to TH1435 and 18 specific to TH1436 
are present. The sequences of particular interest include a unique 
gene cluster for the biosynthesis of bacteriocins, resistance to an- 
tibiotics, and toxic compounds, as well as several gene clusters that 
indicate that TH1435 harbors at least one prophage. 

Nucleotide sequence accession numbers. The data from these 
whole-genome shotgun projects were deposited in GenBank un- 
der the accession no. AYSGOOOOOOOO for S. thermophilus TH1435 
and AYTTOOOOOOOO for S. thermophilus TH1436. The versions 
described in this paper are versions AYSGOIOOOOOO and 
AYTTOIOOOOOO, respectively. 
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